13 research outputs found
The Virtual Block Interface: A Flexible Alternative to the Conventional Virtual Memory Framework
Computers continue to diversify with respect to system designs, emerging
memory technologies, and application memory demands. Unfortunately, continually
adapting the conventional virtual memory framework to each possible system
configuration is challenging, and often results in performance loss or requires
non-trivial workarounds. To address these challenges, we propose a new virtual
memory framework, the Virtual Block Interface (VBI). We design VBI based on the
key idea that delegating memory management duties to hardware can reduce the
overheads and software complexity associated with virtual memory. VBI
introduces a set of variable-sized virtual blocks (VBs) to applications. Each
VB is a contiguous region of the globally-visible VBI address space, and an
application can allocate each semantically meaningful unit of information
(e.g., a data structure) in a separate VB. VBI decouples access protection from
memory allocation and address translation. While the OS controls which programs
have access to which VBs, dedicated hardware in the memory controller manages
the physical memory allocation and address translation of the VBs. This
approach enables several architectural optimizations to (1) efficiently and
flexibly cater to different and increasingly diverse system configurations, and
(2) eliminate key inefficiencies of conventional virtual memory. We demonstrate
the benefits of VBI with two important use cases: (1) reducing the overheads of
address translation (for both native execution and virtual machine
environments), as VBI reduces the number of translation requests and associated
memory accesses; and (2) two heterogeneous main memory architectures, where VBI
increases the effectiveness of managing fast memory regions. For both cases,
VBI significanttly improves performance over conventional virtual memory
POLCA: Power Oversubscription in LLM Cloud Providers
Recent innovation in large language models (LLMs), and their myriad use-cases
have rapidly driven up the compute capacity demand for datacenter GPUs. Several
cloud providers and other enterprises have made substantial plans of growth in
their datacenters to support these new workloads. One of the key bottleneck
resources in datacenters is power, and given the increasing model sizes of
LLMs, they are becoming increasingly power intensive. In this paper, we show
that there is a significant opportunity to oversubscribe power in LLM clusters.
Power oversubscription improves the power efficiency of these datacenters,
allowing more deployable servers per datacenter, and reduces the deployment
time, since building new datacenters is slow.
We extensively characterize the power consumption patterns of a variety of
LLMs and their configurations. We identify the differences between the
inference and training power consumption patterns. Based on our analysis of
these LLMs, we claim that the average and peak power utilization in LLM
clusters for inference should not be very high. Our deductions align with the
data from production LLM clusters, revealing that inference workloads offer
substantial headroom for power oversubscription. However, the stringent set of
telemetry and controls that GPUs offer in a virtualized environment, makes it
challenging to have a reliable and robust power oversubscription mechanism.
We propose POLCA, our framework for power oversubscription that is robust,
reliable, and readily deployable for GPU clusters. Using open-source models to
replicate the power patterns observed in production, we simulate POLCA and
demonstrate that we can deploy 30% more servers in the same GPU cluster for
inference, with minimal performance los
Hybrid Computing for Interactive Datacenter Applications
Field-Programmable Gate Arrays (FPGAs) are more energy efficient and cost
effective than CPUs for a wide variety of datacenter applications. Yet, for
latency-sensitive and bursty workloads, this advantage can be difficult to
harness due to high FPGA spin-up costs. We propose that a hybrid FPGA and CPU
computing framework can harness the energy efficiency benefits of FPGAs for
such workloads at reasonable cost. Our key insight is to use FPGAs for
stable-state workload and CPUs for short-term workload bursts. Using this
insight, we design Spork, a lightweight hybrid scheduler that can realize these
energy efficiency and cost benefits in practice. Depending on the desired
objective, Spork can trade off energy efficiency for cost reduction and vice
versa. It is parameterized with key differences between FPGAs and CPUs in terms
of power draw, performance, cost, and spin-up latency. We vary this parameter
space and analyze various application and worker configurations on production
and synthetic traces. Our evaluation of cloud workloads shows that
energy-optimized Spork is not only more energy efficient but it is also cheaper
than homogeneous platforms--for short application requests with tight
deadlines, it is 1.53x more energy efficient and 2.14x cheaper than using only
FPGAs. Relative to an idealized version of an existing cost-optimized hybrid
scheduler, energy-optimized Spork provides 1.2-2.4x higher energy efficiency at
comparable cost, while cost-optimized Spork provides 1.1-2x higher energy
efficiency at 1.06-1.2x lower cost.Comment: 13 page
A Stress Induced Source of Phonon Bursts and Quasiparticle Poisoning
The performance of superconducting qubits is degraded by a poorly
characterized set of energy sources breaking the Cooper pairs responsible for
superconductivity, creating a condition often called "quasiparticle poisoning."
Recently, a superconductor with one of the lowest average quasiparticle
densities ever measured exhibited quasiparticles primarily produced in bursts
which decreased in rate with time after cooldown. Similarly, several cryogenic
calorimeters used to search for dark matter have also observed an unknown
source of low-energy phonon bursts that decrease in rate with time after
cooldown. Here, we show that a silicon crystal glued to its holder exhibits a
rate of low-energy phonon events that is more than two orders of magnitude
larger than in a functionally identical crystal suspended from its holder in a
low-stress state. The excess phonon event rate in the glued crystal decreases
with time since cooldown, consistent with a source of phonon bursts which
contributes to quasiparticle poisoning in quantum circuits and the low-energy
events observed in cryogenic calorimeters. We argue that relaxation of
thermally induced stress between the glue and crystal is the source of these
events, and conclude that stress relaxation contributes to quasiparticle
poisoning in superconducting qubits and the athermal phonon background in a
broad class of rare-event searches.Comment: 13 pages, 6 figures. W. A. Page and R. K. Romani contributed equally
to this work. Correspondence should be addressed to R. K. Roman
LEARN: A multi-centre, cross-sectional evaluation of Urology teaching in UK medical schools
OBJECTIVE: To evaluate the status of UK undergraduate urology teaching against the British Association of Urological Surgeons (BAUS) Undergraduate Syllabus for Urology. Secondary objectives included evaluating the type and quantity of teaching provided, the reported performance rate of General Medical Council (GMC)-mandated urological procedures, and the proportion of undergraduates considering urology as a career. MATERIALS AND METHODS: LEARN was a national multicentre cross-sectional study. Year 2 to Year 5 medical students and FY1 doctors were invited to complete a survey between 3rd October and 20th December 2020, retrospectively assessing the urology teaching received to date. Results are reported according to the Checklist for Reporting Results of Internet E-Surveys (CHERRIES). RESULTS: 7,063/8,346 (84.6%) responses from all 39 UK medical schools were included; 1,127/7,063 (16.0%) were from Foundation Year (FY) 1 doctors, who reported that the most frequently taught topics in undergraduate training were on urinary tract infection (96.5%), acute kidney injury (95.9%) and haematuria (94.4%). The most infrequently taught topics were male urinary incontinence (59.4%), male infertility (52.4%) and erectile dysfunction (43.8%). Male and female catheterisation on patients as undergraduates was performed by 92.1% and 73.0% of FY1 doctors respectively, and 16.9% had considered a career in urology. Theory based teaching was mainly prevalent in the early years of medical school, with clinical skills teaching, and clinical placements in the later years of medical school. 20.1% of FY1 doctors reported no undergraduate clinical attachment in urology. CONCLUSION: LEARN is the largest ever evaluation of undergraduate urology teaching. In the UK, teaching seemed satisfactory as evaluated by the BAUS undergraduate syllabus. However, many students report having no clinical attachments in Urology and some newly qualified doctors report never having inserted a catheter, which is a GMC mandated requirement. We recommend a greater emphasis on undergraduate clinical exposure to urology and stricter adherence to GMC mandated procedures
Multi objective dynamic optimization study of butylated urea formaldehyde resin process in a batch reactor
Butylated urea formaldehyde (BUF) is a key intermediate for manufacturing paint and coating. The quality of BUF resins can be measured in terms of the concentration of free formaldehyde in the BUF resins and the extent of butylation. We in this work present an optimal control study to obtain minimum free formaldehyde concentration and minimum butanol concentration at the end of the batch operation. Reactor temperature is used as the manipulated variable and optimum temporal reactor temperature profiles are obtained using control vector parameterization approach. The two aforementioned criteria are observed to be mutually conflicting and hence the multi-objective optimal control problem is solved in this work yielding the pareto optimal curve showing the trade-off solutions of the MOO problem. Such pareto optimal curve helps the operator to choose an operating condition for a desired operation.by Garima Patel, Shital Amin, Nitin Padhiyar and Pratyush Daya
CURRENT ASSESSMENTS REGARDING THE PATHOGENESIS AND TREATMENT STRATEGIES OF ORAL LICHEN PLANUS – A REVIEW
Oral Lichen planus (OLP) is an autoimmune chronic inflammatory disease of mucous membrane. It is mostly CD8+T-cell mediated autoimmune response with unknown etiology and pathogenesis. It generally affects approximately 1% to 2% of the world’s population. OLP affects women more than men at a ratio of approximately 1.4:1. The prevalence of OLP ranges between 0.5% and 3% and in Indian populations it is 2.6%. In recent years, many possible causes regarding the pathogenesis of OLP have been suggested, the exact nature is still unclear. Most data suggests that some specific antigen and nonspecific mechanism are involved. Antigen presentation by basal keratinocytes and antigen-specific keratinocyte killing by CD8+ cytotoxic T-cells are said to be antigen-specific mechanisms. It is still not clear whether antigen is exogenous or endogenous in origin and what specific antigen is responsible for triggering the inflammatory responses. Mast cell degranulation and activation of matrix metalloproteinase (MMP) specific mechanisms. This paper explains how these two mechanism work together and also the current understanding regarding other factors which are responsible for its pathogenesis
Co-production of syngas and potassium-based fertilizer by solar-driven thermochemical conversion of crop residues
We report on the thermochemical conversion of inedible crop residues using concentrated solar energy as the source of high-temperature process heat. Experiments were performed using a 5 kWth solar packed-bed reactor exposed to radiative fluxes up to 1788 suns. The waste biomass feedstock consisted of unprocessed batches of cotton boll, soybean husk, and black mustard husk and straw, which were pyrolysed and steam-based gasified at nominal temperatures in the range 879–1266 °C, yielding high-quality syngas with molar ratios in the range H2:CO = 1.43–3.25, CO2:CO = 0.28–1.40, and CH4:CO = 0.03–0.28. The solar-to-fuel energy conversion efficiency, defined as the ratio of the heating value of the syngas produced to the solar radiative energy input and the heating value of the feedstock, reached 18%. The heating value of the feedstock was solar-upgraded by 7%, thus outperforming autothermal gasification that typically downgrades by at least 15%. The ash contained 23% potassium. The solar-driven thermochemical process offers a sustainable and efficient path for the conversion of agricultural wastes into valuable fuels and soil fertilizers.ISSN:0378-382